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We study partition of networks into basins of attraction based on a steepest ascent search for the 
node of highest degree. Each node is associated with, or "attracted" to its neighbor of maximal 
degree, as long as the degree is increasing. A node that has no neighbors of higher degree is a 
peak, attracting all the nodes in its basin. Maximally random scale-free networks exhibit different 
behavior based on their degree distribution exponent 7: for small 7 (broad distribution) networks 
are dominated by a giant basin, whereas for large 7 (narrow distribution) there are numerous 
basins, with peaks attracting mainly their nearest neighbors. We derive expressions for the first 
two moments of the number of basins. We also obtain the complete distribution of basin sizes 
for a class of hierarchical deterministic scale-free networks that resemble random nets. Finally, we 
generalize the problem to regular networks and lattices where all degrees are equal, and thus the 
attractiveness of a node must be determined by an assigned weight, rather than the degree. We 
derive the complete distribution of basins of attraction resulting from randomly assigned weights in 
one-dimensional chains. 

PACS numbers: 89.75.Hc,89.75.Fb,02.50.-r 



I. INTRODUCTION 



Networks often have heterogenous structure, with dif- 
ferent nodes highly varying in their connectivity and 
in their roles 111 0, SB- The problem of identify- 
ing these roles and assigning nodes to communities or 
modules based on their function is of great interest, 
with many methods and algorithms recently proposed 

[I H, 0, S, i M El E3, M EI EE El E3- Th ese 

methods aim to incorporate knowledge of the global net- 
work's structure with information of the nodes' local con- 
nections, to generate a network partition. However, on 
a more fundamental level, nodes can be simply distin- 
guished according to the node that is their "authority" , 
"attractor" , or, in heterogeneous networks, their "hub" . 

The hub that each node belongs to is found by moving 
recursively onto the neighbor of highest degree, or num- 
ber of connections, until the hub is reached — a node 
whose degree is greater than that of all of its neighbors. 
Classifying nodes by their hubs leads to a natural parti- 
tion of the network into basins of attraction. See Fig. [T] 
for a schematic illustration. This partitioning provides a 
quick and easy way to classify nodes based on their rela- 
tion with the network's major players, without resorting 
to external information. 

In general, when each node is associated with a value 
of a scalar field, a "gradient network" emerges by replac- 
ing all the links that emanate from a node by a single 
directed link that points to the node's neighbor with the 
highest value of the field (HEll- Thus, recursively fol- 
lowing nodes of highest degree is equivalent to traversing 
the "gradient network" formed by considering the scalar 



field defined by the degrees of the nodes. Many proper- 
ties of gradient networks have been studied, such as the 
emerging degree distribution and its relation to the orig- 
inal network topology, and the possibility of congestion 
when too few nodes are receiving the flow that is gen- 
erated by the gradient [H, Ell • Gradient networks have 
also proved useful in the analysis of energy landscapes 
[20I ] , and as the basis for new and improved synchroniza- 
tion 21| and routing [13] methods. Here, we focus on 
the specific case where the value associated with each 
node is the degree, and thus does not require any ex- 
ternal information but the bare topology. The walk up 
the degree gradient identifies each node with one of the 
network hubs. 

The decomposition into steepest-ascent basins is of in- 
terest in many systems. For example, suggested routing 
schemes in communication networks involve transmitting 
all packets through the hub nearest to the source [H, [24[ . 
The size of the basins delimits the performance of such 
routing algorithm. In a different field, an analysis of the 
energy landscape's network of atomic clusters shows that 
the energy of a configuration, or a node, decreases with 
the number of configurations kinetically connected to it, 
which is its degree [25j. Thus, as the system is cooled and 
its energy decreases, configurations with higher degrees 
tend to be visited. The actual partitioning into basins 
determines roughly whether the system would inevitably 
end up in the "ideal glass state" or arrive at one of many 
meta-stable states, depending on the initial conditions 

The topology of the basins is also important if one is 
interested in a local strategy for finding the most con- 
nected node. A network with a single basin would make 
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FIG. 1: (Color online) Decomposition of a network into basins 
of attraction. The number inside each node indicates its de- 
gree, and the arrows point from a node to its attractor — 
the neighbor with highest degree. Nodes with no neighbors of 
higher degree are peaks and are highlighted in the schematic. 
A peak represents a basin of attraction: all nodes which are 
attracted to it belong to its basin of attraction. The basins 
in this schematic have different background colors. 

a steepest ascent search (in the "degree space" ) success- 
ful, while a more complicated topology would require a 
more sophisticated approach. Finally, the properties of 
the basins of attraction can be used to classify networks 
with similar degree distributions but otherwise different 
topology and function. 

The algorithmic aspects of the partition method are 
relatively simple and will be discussed briefly below. Our 
main goal is to study, analytically and numerically, the 
statistical properties of the basins of attraction in en- 
sembles of maximally random scale-free (SF) networks 
[3, 0,11 Q. We find that the topology of basins (i.e. 
their number, sizes, hubs' degree, etc.) shows a strong 
dependence on the degree distribution, and we quantify 
this behavior. We then study the basins' topology in a 
class of deterministic hierarchical SF networks [28|, 
and show that it reflects some prominent properties of 
the random networks. Finally, we generalize the prob- 
lem to the case where the "attractiveness" of each node 
is determined by a random number, a 'height' rather than 
its degree, and derive analytical results for the basins of 
attraction in regular one- and two-dimensional lattices. 

II. DEFINITIONS 

We focus on (undirected) SF networks, i.e., networks 
in which node degree is broadly distributed, usually in 
the form 7(k) ~ fc -7 (m < k < N), where k is the 
degree, m is the minimum possible degree, N is the to- 
tal number of nodes, and 7 > 2 is the degree exponent. 
Many real world networks were shown to be scale-free 
with 7 < 3 p], 0, 0, IU • We nevertheless study networks 
with 7 £ [2,5], so as to reach the near-homogenous limit 
where the degree is narrowly distributed. Our networks 
are static and maximally random, generated according to 
the configuration model [30j : we first draw nodes' degrees 



based on the prescribed distribution, then randomly con- 
nect open links until all nodes have all of their links con- 
nected. 

A precise definition of basins of attraction requires 
dealing with several ambiguities (e.g., how to resolve tie- 
breaks). We opt for the following rules: 

1. Start the search from node i with degree ki and 
neighbors ji,j 2 , jk r 

2. Denote the neighbor that has the highest degree 
as jmax, with degree fcj max = maxjfcjj , fc j2 , kj k }. 
If the highest degree is shared by more than one 
neighbor, choose one of them arbitrarily. 

3. If ki < fcj max) i is attracted to j ma x and both belong 
to the same basin of attraction. 

4. If ki > fcj max , node i is a peak, and is attracted to 
itself, forming a basin of attraction with all nodes 
(if exist) that are attracted to it. 

5. Repeat for all unassigned nodes as the root of the 
search. Each node now belongs to exactly one basin 
of attraction. 

Note that we require ki to be strictly smaller than A)j max 
for i to be attracted to j max ; in other words, a node may 
be a peak even if it has neighbors with equal degree. This 
choice saves us from delving into further subtleties. The 
results are qualitatively the same independent on details 
of the definition (see Appendix lAl for a short discussion). 

A simple and fast partitioning algorithm relics on scan- 
ning the nodes in descending degree order. Then, each 
node is either designated as a peak, or assigned to the 
basin of its neighbor with highest degree. Because we 
scan by degree order, we are guaranteed that the neigh- 
bor was already assigned to a basin. Thus, the running 
time of the algorithm (for sparse networks) is of the order 
of NlogN, the time it takes to sort the nodes [3ll ] . 

III. TOPOLOGY OF BASINS OF ATTRACTION 
IN RANDOM SCALE-FREE NETWORKS 

We attempt to capture the topology of the basins 
through a few representative quantities which we define 
below. Denote the total number of basins by iVj,. Define 
the density of basins as the number of basins per node 
and denote it as rib = Nb/N. Denote next the basin size 
by s. The probability of a basin to be of size s is P(s). A 
related quantity is the probability of a node to belong to 
a basin of size s: Q(s) = NbP(s)s/N = ribsP(s) (and a 
particularly interesting case is the probability of a node 
to be a solitary basin Q(l)). Both P and Q are normal- 
ized probability distributions. Other measures of interest 
are the degrees of the peaks and the size of the largest 
basin S. 

In Figs. we present simulation results for SF net- 
works with N = 1000, minimum degree m = 1,2 and 
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FIG. 2: Average density of basins in SF networks. Plotted 
is (rib) vs. 7, as well as the density of solitary basins, Q(l). 
Simulation results (symbols), for networks with N = 1000 
and m — 1,2, are matched by theory, Eqs. ([2]) and (|B 2[) 
(solid lines). 




FIG. 3: Distribution of basin sizes in SF networks. Plotted 
is Q(s) for three values of 7 (N = 1000, m = 1). Inset: The 
basin exponent a, characterizing the power-law decay at small 
s, plotted vs. the degree exponent 7. 



varying 7. The following picture emerges from the re- 
sults. For small 7 close to 2, the network is dominated 
by one hub, attracting most of the nodes to form a giant 
basin. Thus, the number of basins is relatively small and 
the size of the largest basin is narrowly distributed about 
S ~ N. The sizes of the basins and the degrees of the 
peaks show a bimodal distribution: a peak close to N 
and a fast decay for small basins which are not included 
in the giant basin. 




Degree of peak 



For large 7, a different behavior is observed. The num- 
ber of basins, iV&, is large, and most of the basins are 
small. The largest basin is no longer giant, and its aver- 
age size scales as S ~ N s (S < 1). The distribution of 
the degrees of the peaks approaches the degree distribu- 
tion of the entire network. The distribution of basin sizes 
now exhibits power-law scaling for small s: Q(s) ~ s~ a 
(or P(s) ~ s~( Q+1 )). We term a the basin exponent. 
The minimal degree m significantly influences the basins 
count. For m = 1 the network is usually fragmented, 
and thus many basins can form. For m > 2 the network 
is connected and consists of a single component, so the 
number of basins is smaller. 



FIG. 4: Degree of the peaks in SF networks. Plotted is the 
distribution of the degree of the peaks for three values of 7 
(N = 1000 and m — 1). Inset: For large 7, the exponent /3 
characterizing the decay of the pdf (circles) follows 7 (solid 
line) very closely. 



IV. THEORY 
A. Random scale-free networks 

1. The giant basin 



The crossover between the two limiting cases of net- 
works with a giant basin and networks fragmented to 
many basins is at about j c « 2.8. This is revealed by the 
behavior of the size of the largest basin S : while S ~ N 
for 7 < 7c, there is no longer a giant basin for 7 > 7 C 
and S ~ N s with 5 < 1 (Fig. [5]). The minimum in a also 
occurs at 7 w 2.8 (Fig. [3]), and we hypothesize that it is 
another reflection of the transition. 



The transition between a network with giant basin to 
a network fragmented to many basins is observed in the 
simulations at about 7 w 2.8. Interesting questions are 
whether this transition becomes sharp for infinite sys- 
tems, and what is the value of j c for N — > 00. 

A simple argument suggests that for infinite networks 
a sharp transition occurs at j c — 3. To understand that, 
consider first the probability of a given node i of degree 
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FIG. 5: Size of the largest basin. Plotted is the relative size of 
the largest basin (S/N) vs. 7 for SF networks with N = 10 3 , 
10 4 , and 10 5 , and m = 1. Inset: The largest basin's exponent 
6 (S ~ N s ). For 7 < 7c, 5 ~ 1 (S ~ N is a giant basin), 
while for large 7, 5 approaches 1/(7 — 1). 



fc to be a peak, 

Pr{i is a peakjfc,; = fc} = 
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-1 fc 



(1) 



where J'(fc) is the degree distribution, (fc) is the average 
degree, and k - is the probability that a neighboring 
node (which is a node followed by a random link) has 
degree k' [32j . Equation (JTJ results from the requirement 
that none of z's A; neighbors have degree higher than k. 
For large k, we substitute 7(k) ~ A fc~ 7 and approximate 
the sum as an integral 



Y, WW) = <*> - £ fc'W 

k'—m k'—k-\-l 

* <fc) - fc 2 ~ 7 
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For fc » 1, Eq. (JTJ) becomes (7 > 2) 

fc 



Pr{? is a peakjfci — k} — 



1 



fc 7 ~ 2 



exp (-Bfc 3 ~ 7 ) 



where 5 = A/[(fc) (7 - 2)]. Thus, for 7 < 3 the probabil- 
ity of a node to be a peak is small, and approaches zero for 
large k. Therefore, only the node with the largest degree 
in the network can be a peak, and it will attract the giant 
basin. For 7 > 3, every node with large degree is almost 
surely a peak. For even larger 7, 7 > 7* > 3 (where 7* is 
determined by the small k properties of ?(fc)) there is no 
longer a giant component in the network. In that case, 
the size of the largest component scales as TV 1 ^ 7-1 ' [33|. 
The maximal degree of the network has the same scal- 
ing. Since the size of largest basin is at least the maximal 



degree, but cannot exceed the size of the largest compo- 
nent, we conclude that for 7 > 7* > 3, S ~ N s with 
S = 1/(7 — 1). Simulation results support this scaling, 
(inset of Fig. [5]), but it is not known whether a transition 
is expected, for infinite network, at 7*. 

Another heuristic argument in favor of the phase tran- 
sition at 7 C — 3 is the following. Consider two nodes 
with degrees fci and ki close to the maximal degree 
K ~ ATVCt-i) [34]. The probability that these nodes 
are connected is proportional to fcifo/Af [35|. Thus, the 
probability of the two hubs to be connected scales as 
A/-(3-7)/(7-i). Hence, for 7 < 3 the two largest hubs are 
almost surely connected. The hub with the larger degree 
attracts the smaller hub, together with its entire basin, 
to form the giant basin. These arguments are supported 
by simulation results for increasing values of N (Fig. O. 



2. Number of basins and basin sizes 

While we could not obtain a complete derivation of 
P(s) or Q(s) for random static scale-free networks, it is 
possible to obtain analytic results for a few chief quanti- 
ties. Below we derive an exact expression for (rib) as well 
as reasonable approximations for Var(nb) and Q(l). 

Clearly, the number of peaks is equal to the number 
of basins. Thus, the average basin concentration (rib) = 
(Nb) /N is equal to the probability of a node to be a peak. 
The probability of a given node i with degree fc to be a 
peak is given in Eq. ([1]). If the degree of i is not specified, 
we must condition over all possible degrees. Thus, 



(n b ) = J2 nk) 



k—m 



E 

,k'—rn 



k'T(k') 



(2) 



Plugging the degree distribution into ([2]) completes the 
derivation; e.g., for SF networks we substitute 3 3 (fc) = 
fc 1 / Y^k>=m k' -1 ■ ^ comparison of Eq. @ with simula- 
tions yields a perfect agreement (Fig. [5J) . 

Many real- life networks [36] , and in particular growing 
ones, have 7 close to 2 and accordingly, a logarithmically 
diverging average degree (fc) ~ In N . Consequently, the 
fc = to term dominates Eq. |2|): 



(n b ) - ?(to) [TO?(m)] m (lniV)- 



+ Q[(kJ\r)~ (m+1) ]> 
(3) 

• for N — > 00 (such 
2 for m = 1 in Fig. [2] 



and we expect (rib) ~ [logiV] - " 
that the finite value of (rib) at 7 
is a finite size effect). 

The calculation of the variance of Nb is more involved 
since the joint probabilities for multiple peaks are not 
independent. An approximate expression is given in Ap- 
pendix [B] and is plotted in Fig. [TU1 Rather than the 
foil distribution Q(s), we focus on solitary basins (of size 
s = 1), which account for the bulk of basins. In Ap- 
pendix [B] we derive an approximation for Q(l) which is 
extremely close to simulation results (see Fig. [5]). 
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B. Hierarchical networks 

Deterministic hierarchical scale-free networks provide 
a unique opportunity for an analytical treatment of net- 
works with broad degree distribution [28|, H^]. In the 
following we derive analytical results for the basins topol- 
ogy, which reproduce to some extent the results for ran- 
dom SF networks. In particular, hierarchical networks 
have a giant basin for small 7, and a power-law distribu- 
tion of the basin sizes P(s) for large 7, just as was found 
in Section [Ml for the random networks. 

Hierarchical scale-free networks (2|| [29| are construc- 
ted in a recursive fashion: in (it, i>)-flowers, each link in 
generation n is replaced by two parallel paths consisting 
of 11 and v links, to yield generation n+1 (Fig. [5]); and in 
(u, i>)-trees, defined in analogy to the flowers, we obtain 
generation n + 1 of a (u, v )-tree by replacing every link in 
generation n with a chain of it links, and attaching to each 
of its endpoints chains of v/2 links (assuming v is even). 
A natural choice for the genus of flowers in generation 
n = 1 is a cycle graph (a ring) consisting of u+v = w links 
and nodes, (it, u)-flowers and trees were shown to have 
degree distribution of the form f(k) ~ fc -7 , with 7 = 1 + 
fej-j and are thus scale- free. Considering shortest paths, 
(u, i>)-nets with it — 1 are small- worlds and are otherwise 
fractals [H, [29[. These and other topological properties, 
such as clustering and degree-degree correlations make 
them suitable models for real-life complex networks 

0, i3, m, HI . 

We have derived the complete distribution of P(s) for 
all (it, w)-flowers and trees. This is a tedious exercise 
in real-space renormalization (verified numerically on a 
computer) that adds little physical insight. We thus limit 
the discussion to the results themselves. 

For (1,2) and (1, 3) -flowers, and (l,2)-trees, all nodes 
are evenly split between the w basins peaked in the nodes 
forming the n = 1 generation. Thus, this case corre- 
sponds to the small 7 limit of random SF networks, where 
a giant basin attracts all nodes. 

In (1, i>)-flowers with v > 4 (which corresponds to 7 > 
3), basins of size b m = |4 m + l (to = 0, 1, n— 2) appear 
(v — 3)w n ~ m ~ 1 times. Thus, basins of size s ~ 4 m occur 



with frequency 



u/ln4 



Because the possible basin 



sizes are not continuous but are exponentially spaced, 
this leads to a power-law distribution P(s) ~ s~( Q+1 \ 
with basin exponent a — t^t = (7 — l)/2. 

In (1, i>)-trees the situation is qualitatively similar, but 
more subtle, with different results for v = 4 and v > 
4. For (l,4)-trees, we find that basins of size b m (to = 
2,3, ...,71 - 1) appear 2w n - m ~ 1 times. Here b m ~ Ar m 
(to ^> 1) where r is the larger root of r 2 — 5r + 2 = 0, 
5±^1Z = 4.56. Thus, P(s) ~ s -(« +1 ) with a = 
1.06. For (l,?j)-trees with v > 4, basins of size 

b & - 5 
so P(s) 



or r 

log5 



24^ 



I (m = 2,3,. 
-(«+!) with 



— 1) appear 2w n m times, 
y^j, as in (1, w)-flowers. 
The size of the largest basin for all (l,u)-nets (except 
(l,4)-trees) is S ~ N 5 with 8 = 2/(7 -1). 



(a) 



n=1 











n=2 













FIG. 6: Hierarcical scale-free (u, u)-flowers. Shown are two 
examples of networks with degree exponent 7 = ln(u + 
u)/ln2 = 3, with (a) u = 1, v = 3, and (b) u = 2, v = 2. In 
both cases the top of the figure illustrates the edge replace- 
ment scheme, to two parallel paths of u and v edges, while 
the bottom of the figure shows flowers obtained in this way 
to generations n = 1, 2, and 3. 



In (it, u)-nets with 11 > 2 and v > 2 the number 
of basins of size 2 m + 1 (1.5 • 2 m + 1 for (u, 4)-trees), 
to = 3,4, ...,n — 1, is (w — 2)w n ~ m for flowers and 
[w — 3 + 2/ui]k/ 1 ~" 1 for trees. This simply leads to 
P(s) ~ s" 7 , or a = 7 — 1. Essentially, due to the strong 
disassortative nature of (it, i>)-nets with it > 2 and v > 2, 
the basins in these networks typically consist of a peak 
and its immediate neighbors, so the basins sizes mirror 

the degree distribution. Indeed, the size of the largest 

1 

basin 5* ~ N-t- 1 , has the same scaling as the largest 
degree [Hill]. 

To summarize, with the exception of (l,4)-trees: 



= (7"l)/2 7>3, 
!l , K 1 w giant basins 7 < 3. 



(4) 
(5) 



Key features revealed by this analysis compare favorably 
with the results in random scale- free networks. The giant 
basins found for hierarchical nets with 7 < 3 parallels the 
low-7 phase found in random nets. The power-law decay 
found for 7 > 3 agrees with the findings for large 7 in 
random nets, as does the increase of a with increasing 7. 
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V. RANDOM SURFACES 

The decomposition of a network into degree-based 
basins of attraction is a special case of a general problem 
of finding the basins when the attractiveness of a node 
is determined by a certain attribute. The association of 
a scalar field with the network nodes and the e merg ence 
of a "gradient network" were sugg ested in [U, Gj5|] and 
discussed in Section fl] Here, our main interest is in the 
basins of attractions induced by the external field. In 
particular, determining the attractiveness of a node by 
an external parameter allows the basins of attraction to 
be defined in regular networks or lattices where all sites 
have the same degree. As a basic example, we discuss 
one- and two-dimensional lattices where each node is as- 
signed a random height (or potential energy, density, etc.) 
The understanding of the topolo gy o f such random sur- 
faces is of much importance (iol. l4l| . For example, the 
number of peaks determines the number of possible non- 
satisfied bonds in a spin glass [U E3|_or the "roof" of 
the surface in ballistic growth models [44j. 

The height hi of lattice site i is taken from some distri- 
bution (independently of the other lattice sites). With- 
out loss of generality, one may assume the distribution 
is uniform, in the interval [0, 1]. Nodes are attracted to 
their shortest neighbor, so that the surface is energy-like 
(Fig. [7|). The topology of the basins, in this case, has a 
clear physical interpretation: Put a particle in each node 
of the lattice and let the particles follow paths of steepest 
descent. When the system stops evolving, the number of 
particles s in each minimum is the size of its basin of 
attraction. 




FIG. 7: (Color online) Schematic representation of a random 
surface in one dimension. Each node is attracted to its short- 
est neighbor (with periodic boundary conditions). The four 
highlighted nodes are valleys — their neighbors are taller. All 
nodes attracted to a valley belong to its basin of attraction. 
The four basins in the drawing are shown in different back- 
ground colors. 

In one dimension, each point on the surface is either 
a local maximum (peak), a local minimum (valley), or it 
has one taller and one shorter neighbor. To find the den- 
sity of peaks we look at any three consecutive heights 
(hi,h2,hs) and notice that the probability that hi is 
maximal is 1/3. Similarly, the density of valleys is also 
1/3. The variance in the number of peaks/valleys can be 



derived following similar steps as for networks (Appendix 
HJ and turns out to be 27V/45 [13]. 

Let us calculate the probability of a node to be a valley 
of a basin of size s, R(s) — {rib} P( s )- The minimal size 
s = 1 of the basin is obtained in the situation when the 
minimum is surrounded by two taller heights whose other 
adjacent heights are shorter than the minimum. If h is 
the height of the minimum, the above situation occurs 
with probability [h(l — h)] 2 . Integrating over h we find 
the density of smallest basins 

= fdhh 2 {l-h) 2 = 1. (6) 

For s > 2, the density of basins of attraction of size s 
is given by 

2 S + 3 , . 4(s 2 + 3s + l) 

^ = (inr (5+3) ~ o.+3)! • (7) 

The derivation of this result is presented in Appendix 
[Cl One can verify the validity of both the normalization 
requirement and the density of valleys: 

5>i?( s ) = i, £fl(„) = i = <„ 6 >. 

S>1 S>1 '' 

For large s, R(s) ~ 1/s!, which decays much faster than 
the power-law decay observed for networks. 

In two dimensions, basins of attraction are similarly 
defined as the set of all nodes which are attracted to a 
given valley. We limit ourselves to the analytical com- 
putation of R(s — 1), as larger basins of attraction seem 
to require very tedious calculations. Let h be the height 
of the minimum of an s = 1-basin of attraction. The 
adjacent four heights must be taller, which happens with 
probability (1 — h) A . We write 

R(l) = f dh(l-h) 4 a{h), (8) 
Jo 

and the chief problem is to determine the probability 
a(h) that for each of the 4 adjacent sites there is a neigh- 
bor which is shorter than h. Let {xi} be the heights of 
diagonal sites (±1, ±1), and {yj} the heights of the sites 
(0, ±2) and (±2,0) [we set the minimum at the origin]. 
The probability o~(h) is given by 

a(h) = (1 - h) 4 h 4 + 4fc(l - hfh 2 
+ 2h 2 {l-h) 2 + 4h 2 (l-h) 2 h 
+ Ah 3 {l-h) + h 4 . 

Indeed, one possibility is that all the Xi exceed h, and 
then all the yj must be shorter than h. This happens 
with probability (1 — h) 4 h 4 . If exactly three of the Xi 
are taller than h, there should be exactly two yj that 
are shorter than h. This explains the term 4/i(l — h) 3 h 2 . 
For the case that two of the taller and two shorter 

than h, consideration of their exact locations leads to the 
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term 2h 2 (l - h) 2 + 4/i 2 (l - h) 2 h. Finally, when at most 
one Xi is taller than h, there is no requirement on the yj. 
Performing the integral in {HJ we obtain 



R(l) 



109 
4290 



(9) 



In the infinite dimensional case, the random surface 
is defined on top of a network, as in gradient networks 
[l8l fl9j . The only quantity that seems easily calculable 
is the average number of basins (rib): the probability of a 
node of degree k to be a valley, for randomly distributed 
heights, is simply l/(fc + 1). Thus, for a network, 



(n 6 ) =£>(*)/(* + !) 



(10) 




FIG. 8: Illustration of the difference between basins of attrac- 
tion and network communities. In the plotted toy network, 
the six nodes in the middle are fully connected and clearly 
form a single community. However, the three nodes on the 
left side are attracted to the left hub, whereas the three nodes 
on the right side are attracted to the right hub. Thus, they 
are split between two different basins of attraction (indicated 
with different background colors and different orientation of 
node fill patterns). 



VI. SUMMARY AND DISCUSSION 

In summary, we have introduced a process of steepest 
ascent that partitions complex networks into basins of 
attraction — subsets of nodes that are attracted to the 
same peak, the node of highest degree in the basin. For 
random scale-free networks we find a transition between 
networks dominated by a giant basin comprising the ma- 
jority of the nodes, for 7 < j c , to numerous, fragmented 
basins, for 7 > j c . We find numerically that 7 C w 2.8, 
while theoretical arguments indicate that for N — > 00, 
7 C = 3. Both above and below the transition point, the 
distribution of finite basins has a power-law tail s~( a+1 \ 
where a, the basin exponent, exhibits a non-trivial de- 
pendence upon the degree exponent 7. An exact analysis 
of deterministic hierarchical scale-free nets exhibits some 
of these features. 

A comprehensive description of the complete distribu- 
tion of basins sizes for static random scale-free networks 
remains a challenge. Furthermore, other types of net- 
works might exhibit a different basin topology. In par- 
ticular, randomly growing networks [ll. |45|| . Erdos-Renyi 
networks [461 ]. and networks with correlations (for exam- 
ple, degree-degree correlations) are of interest and are 
left for future study. 

In a sense, associating each node with a hub and the 
identification of basins of attraction provides a partition 
of the network into communities. Numerous algorithms 
have been proposed to address the problem of classify- 
ing nodes into communities. Interestingly, different algo- 
rithms employ highly diverse methods and transforma- 
tions, or measures, of the network topology. For example, 
many algorithms maximize the modularity index JfJ by a 
wide spectrum of optimization techniques [ToL [TTL [l3L fl5[| . 
Others exploit quantities such as betweeness centrality 
0, Q , traces of random walk [EJp, 0, [TtJ , eigenvectors of 
the network Laplacian [l(| fl^Ula]. electrical conductance 
0, and others. While some algorithms recursively split 
the network into communities separated by "weak links" 
[E S S EH EH EH , others take the bottom-top approach 
and recursively merge highly similar communities, based 



on various similarity indices 0, E3|- Also, while many 
algorithms output a dendogram (a tree) with partition 
of the network into disjoint communities at all possible 
levels of resolution, other studies provide an overlapping 
community structure; for example, based on identifica- 
tion of almost complete subgraphs [l2| or mapping to 
magnetic domains [8|. 

How is the partition into basins of attraction com- 
pared to other community detectors? First, most algo- 
rithms are global, since they utilize as much information 
as possible about the network topology to improve the 
identification of the communities. In contrast, few other 
methods (e.g., [H, H3, IH, H^|), including our basins of 
attraction, are computed in a local manner — each node 
is assigned to a community based only on its immediate 
neighborhood. Second, and more important, the goal of 
most community detectors is to find a partition that max- 
imizes intra-community proximity and inter-community 
separation. That usually takes the form of maximizing 
the number of links within a community while minimizing 
the number of links between communities. As opposed 
to that, our partition to basins of attraction addresses 
a different question: which nodes are affiliated with the 
same hub? While in many cases this attribute is corre- 
lated with community structure, this is not necessarily 
always the case, as we demonstrate in Figure |S] 

A possible outcome of our analysis is revealed when 
we test two real-life networks for which the problem of 
basins is of practical importance: The Internet at the Au- 
tonomous Systems (AS) level [HJ as of 2007, and the en- 
ergy landscape's network of Lennard- Jones clusters [25| . 
Both networks are scale-free, with 7 = 2.5, 2.9, respec- 
tively. In both networks there is a giant basin which 
attracts most nodes (with Verizon's AS being the peak 
in the Internet), and a few tiny basins, in agreement with 
the theoretical results for the model scale-free networks. 
In the Lennard-Jones network, an uphill walk in the de- 
gree space, which can be mapped in general onto a down- 
hill walk in the energy landscape, will end up at the node 
of highest degree, which can be interpreted as the ideal 
glass state [26|. Note the different situation for the en- 
ergy landscape of proteins, where the energy increases 
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with the degree, such that the system is expected to fol- 
low a downhill walk in the degree space [2(| . 

For the Internet, the existence of a giant basin implies 
that a routing scheme that forwards all messages in a 
steepest ascent manner will quickly arrive at the hub. 
From the hub, messages could be routed to their target 
according to a predefined target-specific sequence embed- 
ded in the packet, as was previously suggested [23l. [5l|. 
This leads to an efficient routing scheme which requires 
practically no knowledge of the network topology at the 
nodes, and is thus highly scalable. An obvious draw- 
back of such scheme is the congestion generated at the 
hubs, which is eliminated in other methods (for exam- 
ple, by rout ing through shortest paths when the hubs 
are avoided [52| , or by walking down the congestion gra- 
dient 22]) Therefore, the steepest ascent search might 
not be of immediate applicability to the Internet itself, 
but is however of interest in other newly designed com- 
munication networks where the hubs can carry high load. 
In this context, we note the interesting fact that Boguna 
et. al m also find a transition between navigable and 
non-navigable network topology at 7 ~ 2.6, although in 
their case the navigation is based on minimizing distances 
within a hidden metric space. 

Our partitioning has another potential practical appli- 
cations for locating the node of highest degree in various 
search scenarios. A local search starting from a single 
node and following a steepest ascent would always be 
successful in networks with a single basin of attraction, 
as in scale- free networks with 7 < j c . With more than 
one basin, a strategy could be devised for starting from 
a number of randomly selected nodes to find the highest 
degree with a prescribed rate of success. 

A concrete example for such an application is routing 
in wireless sensor networks A wireless sensor net- 

work is a system consisting of spatially distributed au- 
tonomous devices using sensors to cooperatively monitor 
physical or environmental conditions. In a typical sen- 
sor network, one distinguished node serves as a gateway 
between the sensors and the end users, and must col- 
lect data from the nodes. Since energy is usually a very 
scarce resource at the nodes, an efficient protocol must be 
designed to transmit the measured data to the base sta- 
tion. Thus, our steepest ascent protocol, in which each 
node sends out data to its neighbor with highest degree, 
is of interest. This protocol is expected to be relevant 
in heterogeneous sensor networks, in which the commu- 
nication range varies between the nodes [551 ] . Indeed, we 
found (data not shown) that for a power-law distribu- 
tion of communication ranges $(r) ~ r~ e , there exist a 
regime in (e,(r)) parameter space for which the network 
collapses into a single basin, making the steepest ascent 
protocol highly efficient. 
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APPENDIX A: ALTERNATIVE DEFINITION OF 
BASINS 

When the weights of the nodes are taken from a dis- 
crete distribution, as in the case where the weight is the 
degree of the node, neighboring nodes may have the same 
weight. A method is then required to break the tie. In 
Section [TT] we presented an algorithm that overcomes this 
difficulty, which we term the local search algorithm. The 
following recursive search algorithm works as well. 

Suppose the search starts at node i, and let j max be 
the neighbor(s) of i of highest degree fcj max . Denote the 
number of neighbors with degree fc Jmax as q. 

1. If ki > &j max , i is a peak. 

2. If ki < fej- max , i is attracted to j max . (If there is more 
than one neighbor with degree /cj max (i.e., q > 1), 
select one randomly.) 

3. If ki — fcj max , mark i as visited and look for the at- 
tractor of j max , recursively, among unvisited nodes. 
If q > I, look also for the attractors of all other 
neighbors of i with degree fcj max . Keep only the 
attractor of highest degree among the q attractors. 

4. If the degree of the attractor of j max is larger than 
ki, i is attracted to j max . If the degree of the at- 
tractor of j max equals ki, i is a peak. 

In other words, in a search for a peak strictly higher 
than its neighbors, we are allowed to surf over "ridges" 
of connected nodes of equal degree, until either reaching 
a peak or a dead end. 

Despite the broad distribution of degrees in SF net- 
works, the majority of the nodes have the minimal de- 
gree m, or a degree close to m. Thus, one may ex- 
pect many ridges to form and as a result, a different 
basin count, depending on whether the local or recur- 
sive search is employed. For example, in the hierarchical 
networks studied in Section [51 a recursive search yields 
a single giant basin for all (l,w)-nets. In random SF 
networks with large 7 the recursive search method also 
yields fewer basins (Fig. [9fa)), which is explained by the 
prevalence of ridges, in this case, due to the high den- 
sity of small-degree nodes. However, broader properties 
of the basins topology remain unaffected by the search 
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FIG. 9: Basin topology with local vs. recursive search, 
(a) Density of basins, and (b) the basin exponent a as a func- 
tion of 7, in SF networks with N = 1000 and m = 1. 



Let nodes i and j have degrees k\ and fe, respectively. 
What is the probability for both nodes i and j to be 
peaks? We condition this probability on whether i and 
j are connected, which is k\k>z/(N (k)) [35|. If they are 
connected, and they have different degrees, clearly only 
one of them can serve as a peak, so only the case when 
both have the same degree k contributes to the covari- 
ance. Also, we have to take into account that i and j 
might share common neighbors. Thus, the probability 
of i to be a peak is enhanced if j is known to be one. 
We make the approximation that the number of com- 
mon neighbors c is fixed once fcj and kj are given, and is 
given by: 



c ki , kj n(N-2) W 



he — m 



kikj \ 

N(k)N(k) - N(kf 



since this is the probability, summing over all possible 
degrees of the node i ^ that it is adjacent to both 
i and j. For both i and j to be peaks, if fcj < kj, fcj 
nodes need to have degree less than fcj, but only kj — 
c nodes need to have degree less than kj (since the c 
common nodes are guaranteed to have degree less than 
fej < kj), and vice- versa if ki > kj. Approximating the 
probabilities for two nodes without common neighbors to 
be peaks as independent, we get, 



algorithm: Q(s) is practically the same, for large s, as is 
also the basin exponent a, extracted from either method 
(Fig. lib)). 



APPENDIX B: PROPERTIES OF THE NUMBER 
OF BASINS IN SF NETWORKS 

In this appendix, we calculate two quantities related 
to the number of basins. 



1. Variance of the number of basins 

Denote by Ai the indicator of the event that node i is a 
peak, such that Nb = Yl^-i A%. To compute the variance, 
we shall use the general formula 

N N N 

Var(Ay = Var(^) + 2^T£ Cov(A;, A 3 ) (Bl) 

i— 1 i—l j>i 

The first term on the right-hand side is easy to compute: 

N N 

]T Var(^) = Y P{AMl - P{A}) = N (n b ) (1 - (n b )) 



i=i 



i=i 



Here P{A;} is the probability for A, to occur. For the 
second term, we get 

Cov(Ai,Aj) = (AiAj) - (At) (Aj) 

= P{A i Aj}-P{A i }P{A j } 



Cov(Ai,Aj) = J2 k 2 [?(k)} 2 [f(k)} 2 ^- c ^/(N{k)) 

k—m 

OO s 

2 Y E i 



hh * [f{ki)]Hf{k2)P~ Ck ^ 



N(k) 



-m 

oo 

E nwwf 



N(k) 



[f(k)} 



2k—Ck,k 



where f(k) 



k'=m Jk) 



k'V(k') 



is the probability for a 



neighbor to have degree no larger than k. The first term 
corresponds to the case where the nodes are directly con- 
nected and have identical degree; the second term is the 
case when they are not directly connected, and have dif- 
ferent degrees; in the third term they are not directly 
connected but have equal degree; and the last term is 
just P{Ai}P{Aj} = P{Ai} 2 . This formula is compared 
to simulations in Fig. [10] to find a qualitative agreement. 
We also plot the (rib) (1 — (nb)) term alone, neglecting 
the covariance, and find that it is a good approximation 
for the case of large 7. 



2. Density of basins of size one 

Consider a given node i with degree k and take one 
of its neighbors j; suppose this neighbor j has degree k'. 
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FIG. 10: Variance in number of basins for SF networks with 
N = 1000 and m = 1. Symbols represent simulation results, 
and the solid line corresponds to Eq. (|B2[) . The dashed line is 
obtained by neglecting the covariance term and has the form 
(rib) (1 — (nb)) (where (ni) is calculated from Eq. (J2J))- 



For i to be a peak, k' must be less than or equal to k. 
For i to form a basin of size one, j must have at least one 
neighbor of degree fc + 1 or above, in order to be attracted 
to that neighbor and not to i. If we assume that at least 
up to the second shell, i is a root of a tree, we have: 



Q(l) = £ T{k) 



k7(k) 



k f —m 



(k) 



where 



k"—m 



k"7(k" ) 



-, fe'-l 



is the probability that at least one of the k' — 1 neighbors 
(others than i) of j has degree above fc. We wrote a 
separate term for the case of k = k, since in this case we 
are guaranteed that j is not attracted to i, regardless of 
the degrees of the neighbors of j. The small correction 
due to the case when j has another neighbor (other than 
i) of degree exactly k can be calculated analytically as 
well, but was found to be negligible. For j — > 2, when 
(k) ~ log AT, only the k = m term is significant, and 
thus Q(l) - Tim) [mT(m)} m (\n N)- m , and almost all 
basins are solitary (see Eq. Q). This is confirmed in the 
simulations (Fig. [5]). 



APPENDIX C: BASINS OF ATTRACTION IN 
ONE DIMENSION 

We look at the distribution of basins of attraction in 
one-dimensional lattices. Consider first a valley sepa- 
rated by distance i from the peak on the left and dis- 
tance j from the peak on the right, such that particles 



from both peaks belong to its basin of attraction. The 
probability of this is 



dhUf(h)Uf(h), 



(CI) 



where, e.g., 11+ (h) is the probability that j heights to the 
right of the valley of height h are ascending and the last 
height is the peak which belongs to the basin of attraction 
of our valley. The probability Ht(h) admits an integral 
representation 

fe+i 



Integrating over X\ 
into 



nj(ft) = / , T\dx a . 

K Jh<X!<...<X k <l J.J. 

~k-l<x k + i<x k a=l 

Xfc-i we recast the above integral 



dxk dx 



fc+i 



>h<x k+1 <x k <l (A -1)1 

and the remaining integration is trivial: 

LL « [h) - (fc + i)! ' 

Inserting this equation into (jClj) we obtain 

++ _ 1 1 

» = (i + l)!(j + l)! i + j + 3' 

Similarly, we compute 



R±~= [ dhU+(h)Uj(h), 
Jo 



(C2) 



(C3) 



(C4) 



where U~(h) is the probability that j + 1 heights to the 
right of the valley of height h are ascending and the last 
height is the peak which belongs to the basin of attraction 
of the next valley (to its right). The probability H7(h) 
can be written as 



fc+2 



h<aJi<...<x fc+1 
x k+ i>x k+2 <x k a=l 



The two last integrations are easily performed, 

U k( h )= x k {l - x k )Y\_ dx a ■ 

J h<xi<...<x k <l i 



Integrating over x\ , 
into 



!<...<X k <l Q=1 

, Xk-i we recast the above integral 



r 1 ( Xk - Mfe- 1 

n fe (h) = J h dx k x k (l - x k ) — — 



which is then computed to yield 
(1 - h) k+1 



(fc + 1)! 



1 - (1 - h) 



k + 2 



(C5) 
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Plugging ((C2J) and (JC5J) into (JC4J> we obtain 

1 2 



(i + l)!(j + 2)! i + i + 4 
Since R~j + = , the last quantity to compute is 



(C6) 



i?-.-= f 1 dhUr(h)nj(h). 
Jo 



(C7) 



Using (|C5|) we get 



d — = i L_ 

« (i+l)!(,- + l)! i + j + 3 
1 2 



(i + l)!(j + l)! i+j + 4 
1 4 



i + 2 j + 2 



(* + 2)!(j+2)! i + j + 5 



(C8) 



Equation (|C3|) is valid when i > 1, j > 1, and the size 
of the basin of attraction iss = i+ j + l > 3. Overall, 
the density of basins of attraction of type H — h of size s is 



R ++ (s) 



E R t 



1S+1 



i>l j">1 
i+j— s — 1 



(s + 2)! (s + 2)s! 



(C9) 



Equation (|C6j) is valid when i > 1, j > 0. The density 
of basins of attraction of type H — of size s is 



R+ ~( s ) = E R 



i>l,j>0 
i+j=s— 1 



Computing the sum we find 

r>S+l _ 9 I 



- 2 



(s + 2)! (s + 2)s! 

2 .s+2 _ 2 4 



(s + 3)! (s + 3)(s + l)! 



(CIO) 



Of course, R + (s) = R + (s), and (|C10|) is valid when 
s > 2. 

Equation (|C8|) is valid when z > 0, j > 0. The density 
of basins of attraction of size s and type is given by 



R ~~( s )= E R 



i>0,j>0 
i+j=s-l 



Computing the sum we find 

2 S+1 - 2 2 S + 2 - 2 4 
(S) ~ (s + 2)! ~ (s + 3)! + (s + 3)(s + l)! 
2«+3 _ 2 g 

+ 4 (s + 4)! ~ (s + 4)(s + 2)!' (CU) 

which is valid for s > 1. Defining i? ++ (l) = R ++ {2) = 
and R + ~(l) = R~ + (l) = 0, we finally have (s > 1): 

R{s) = R ++ {s) + R + -{s) + R- + {s)+R—{s). (C12) 



Inserting (|C9|) . (fCTOj) . and (fCTTjl into (fCT2|) we arrive at 
the announced result 0. 
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